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Field of the Invention 

The present invention relates to a method and apparatus for 
encoding moving pictures. In particular, the present invention relates to a method 
and apparatus for performing variable bit rate control in a digital video encoder 
while maintaining a particular overall bit-rate. 



Description of the Related Art 

One of the main obstacles faced by industry dealing with digital video 

10 processing, storage and transmission is the large amount of data needed to 
represent analog video in the digital domain. Accordingly, digital compression is 
often applied to moving pictures to achieve a reduction in required transmission 
bandwidth or storage size. One variety of such compression techniques can be 
derived from the ISO/IEC MPEG Standards, the ISO/IEC 11 172-3(MPEG-1), the 

15 ISO/IEC 13818-2 (MPEG-2) and the MPEG-2 TM5 (test model 5), developed by 
the Moving Picture Experts Group of the International Organization for 
Standardization. The disclosures of those standards documents are hereby 
expressly incorporated into this specification by reference. MPEG-1 is the 
compression standard used in VideoCD while MPEG-2 is the video compression 

20 standard used in DVD and many digital broadcasting systems. 

The MPEG standards specify only the syntax of the compressed bit 
streams and method of decoding. The method of implementation in the encoder is 
left to the developer, and any form of encoder may be employed as long as the 
resulting bit stream conforms with the specified syntax. 

25 In certain applications such as video storage device (recorder), it is 

possible to use variable bit rate (VBR) encoding. A VBR encoder is able to vary its 
output bit-rate over a larger range than a CBR encoder, and this would generate 



an output which has a more constant visual quality. An example of a VBR encoder 
is described in United States patent number 5,650,860, entitled "Adaptive 
Quantization". In order to maintain a maximum bit rate allowed by the target 
storage device as well as an overall bit-rate which enables input picture sequence 
5 to be stored into a defined storage space, such VBR encoders utilize multiple 
encoding passes. 

In the first encoding pass, the bit utilization information is determined 
for each scene or each picture in the input sequence. This may be done by fixing 
the reference quantization step size and disabling the VBV control. The 

1 0 determined bit utilization information is then used to generate a bit budget for each 
scene or picture such that an overall target number of bits to code the sequence is 
fixed, and that the maximum bit rate is not violated. In cases that bit utilization 
information obtained is not close to that required for generating the bit budget, 
steps from the first coding pass must be repeated with an adjusted reference 

15 quantization step-size. The input sequence is coded in a final pass using the 
generated bit budget information to achieve the target bits or overall bit rate. 

Multiple-pass VBR encoders requires large storage memory for 
intermediate bit utilization information, and large computation needs for the 
additional passes and bit budget generation. Furthermore, such a VBR encoder 

20 cannot process input sequences in real-time as required by certain applications. 

BRIEF SUMMARY OF THE INVENTION 

It is an object of the present invention to provide a single-pass real- 
time variable bit rate encoder for moving pictures. It is also an object of the 
present invention to provide variable bit rate encoding of moving pictures such that 
25 the change in encoded picture quality from one scene to another is minimized. A 
further object is to provide a real-time variable bit rate encoding algorithm which 
produces a constant overall bit rate. 

In particular, the present invention encodes an input moving pictures 
sequence one segment at a time according to a target encoding quality which is 



determined by a target segment bit rate. The target segment bit rate of a current 
segment is preferably derived from the differences between the target segment bit 
rates and the actual coding bit rates of previous or previous few encoded 
segments. 

5 To maintain consistent encoding quality for all pictures within a 

segment, the actual target bit rate for encoding the pictures is made variable 
according to their scene complexities as well as the target encoding quality of the 
segment. 

As the target encoding quality of each segment is modified based on 
10 the differences between the target and the actual bit rates of previous or previous 
few encoded segments, the change of target encoding quality from segment to 
segment is made relatively smooth compared to that of a Constant Bit-Rate 
Encoder, and furthermore, the overall bit rate of encoding is maintained constant. 

In accordance with the present invention, there is provided a method 
1 5 for use in a moving pictures encoder for encoding a sequence of segments each 
having at least one image, comprising the steps of: 

a) determining an overall target bit rate for encoding the sequence of 

images; 

b) determining a bit allocation and target quantization step size for 
20 encoding a first segment on the basis of a segment target bit rate calculated using 

said overall target bit rate; 

c) encoding said first segment using a variable bit rate encoding 
method according to the target quantization step size; 

d) determining a difference between the number of bits used to 
25 encode said first segment and said first segment bit allocation; 

e) distributing said difference for use in encoding at least one 
subsequent segment to determine a subsequent segment bit allocation; 

f) determining a new target quantization step size for encoding a said 
subsequent segment on the basis of a new target segment bit rate calculated 

30 using said segment target bit rate and the distributed difference; and 



g) encoding said subsequent segment using a variable bit rate 
encoding method according to the new target quantization step size; 

wherein variable bit rate encoding is employed for encoding pictures 
within a segment whilst maintaining a substantially constant bit rate over said 
5 sequence. 

The present invention also provides a method for encoding moving 
pictures in a moving pictures encoder wherein a sequence of images are provided 
as input, the sequence of images comprising a plurality of segments each having a 
plurality of images, the method including: 
10 a) determining an overall target bit rate for encoding the sequence of 

images; 

b) maintaining a distribution record of bits from at least one 
previously encoded segment allocated for use in encoding at least one segment to 
be encoded; 

15 c) determining a target segment bit rate for a segment of the 

sequence of images on the basis of the overall target bit rate and a bit rate change 
calculated from the corresponding allocated bits from the distribution record; 

d) determining a target segment encoding quality from the target 
segment bit rate, a preceding target segment bit rate and a preceding target 

20 segment encoding quality,; and 

e) encoding the images of the segment according to the target 
segment encoding quality using a variable bit rate encoding technique taking into 
account scene complexities of the images in the segment; 

wherein maintaining said distribution record includes determining a 
25 difference between the number of bits used to encode a particular segment and 
the number of bits allocated for encoding the particular segment on the basis of the 
target segment encoding quality. 

According to the current invention, a moving pictures sequence is 
divided into segments. 
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The size of each segment may be suitably determined. Each 
segment is encoded with a target encoding quality derived from its target segment 
bit rate. A variable bit rate (VBR) encoder is utilized to encode the segment 
according to its target encoding quality. 
5 The target segment bit rate of an initial segment is obtained from a 

user defined target overall bit rate. After encoding the segment, the difference 
between the actual bit rate used and the target segment bit rate is obtained. This 
difference is propagated to the next or next few segments to be coded. This 
process is repeated for each segment; therefore, for each subsequent segment, a 
10 new target segment bit rate is determined from the user defined target overall bit 
rate and the differences between the target segment bit rates and actual bit rates 
of previous or previous few segments. 

The present invention further provides a method for controlling bit 
allocation in a moving pictures encoder for encoding a sequence of images 
1 5 comprising a plurality of segments each having a plurality of images, the method 
including, for each segment: 

determining a difference between a number of bits used for encoding 
a previous segment and a number of bits allocated for encoding the previous 
segment; 

20 calculating a bits distribution from the determined bits difference and 

a predetermined distribution function; 

calculating a bit rate change from the bits distribution and a 
predetermined number of images in the segment; 

calculating a target segment bit rate from the bit rate change and a 
25 predetermined target overall bit rate for the sequence of images; and 

determining a target segment encoding quality from the target 
segment bit rate. 

The present invention further provides an encoding quality 
adjustment processor for generating a target segment encoding quality value in a 
30 moving pictures encoder for encoding a series of segments each having at least 

5 
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one image using a variable bit rate encoding scheme whilst maintaining a 
substantially constant overall bit rate, comprising: 

a bits difference computation means coupled to receive a segment 
encoding bit utilization value and a target segment bit rate and generate therefrom 
5 a bits difference value representing a difference in bits allocated and bits used for 
encoding a segment; 

a bits distribution means coupled to the bits difference computation 
means for computing at least one bits distribution value from the bits difference 
value and a predetermined distribution function; 
10 a bit rate difference computation means coupled to the bits 

distribution means for computing a segment bit rate difference from the at least 
one bits distribution value and a predetermined number of images in a segment; 

a target segment bit rate adjustment means coupled to the bit rate 
difference computation means and the bits difference computation means for 
1 5 computing said target segment bit rate from the segment bit rate difference and a 
predetermined target overall bit rate for the sequence of segments; and 

an encoding quality computation means coupled to the target 
segment bit rate adjustment means for computing a target segment encoding 
quality value from said target segment bit rate. 
20 The present invention further provides a moving pictures encoder 

comprising: 

a coding processor for encoding picture data based on macroblocks 
according to a quantization step size; 

a virtual buffer processor coupled to the coding processor for tracking 
25 a number of bits used for encoding successive macroblocks in a picture and a 
number of bits used for encoding successive pictures in a group of pictures; 

a quantization step size processor coupled to the coding processor 
for determining said quantization step size from a target number bits allocated for a 
picture and the number of bits already used for encoding macroblocks in that 
30 picture; 



a picture bit allocation processor coupled to the quantization step 
size processor for determining said target number of bits allocated for a picture 
from a target bit rate and the number of bits already used for encoding pictures in a 
current group of pictures; 
5 a bit rate adjustment processor coupled to the picture bit allocation 

processor, the virtual buffer processor and the quantization step size processor for 
determining said target bit rate from the number of bits already used for encoding 
successive pictures in the current group of pictures, a target encoding quantization 
step size and an average quantization step size for pictures in the current group of 
10 pictures; and 

a target encoding quantization step size processor coupled to the bit 
rate adjustment processor and the virtual buffer processor for determining said 
target encoding quantization step size from a predetermined target overall bit rate 
and the number of bits used for encoding a preceding group of pictures. 



1 5 BRIEF DESCRIPTION OF THE DRAWINGS 

The invention is described in greater detail hereinafter, by way of 
example only, with reference to the accompanying drawings, wherein: 

Figure 1 is a block diagram of the constant bit-rate controller based 

on TM5; 

20 Figure 2 illustrates the difference between a target bit allocation and 

an actual bits consumption at macroblock level for one frame (a P-picture is 
chosen for this case), where dj is the virtual buffer fullness at a particular instance; 

Figure 3 is a block diagram of the variable bit-rate encoder with 
constant overall bit-rate control; 
25 Figure 4 is a block diagram of the constant overall bit-rate controller; 

Figure 5 illustrates how the difference in bit count for the last 
segment is redistributed over the next four segments (e.g., f (m) = 1/4); and 

Figure 6 is a flow chart of a VBR algorithm with constant overall bit- 
rate control. 
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DETAILED DESCRIPTION OF THE INVENTION 

In a standard MPEG compliant video encoder, a sequence of moving 
pictures (e.g., video) is input to the encoder where it is compressed with a user 
defined target bitrate. The target bitrate is set according to the communication 
5 channel bandwidth in which the compressed video is to be transmitted, or the 
storage media capacity in which the compressed video sequence is to be stored. 
A typical MPEG encoder involves motion estimation/prediction, Inter/lntra 
classification, discrete cosine transform (DCT) computation, quantization, zig-zag 
scanning, variable length coding and rate-control. 

10 Several different forms of coding can be employed depending upon 

the character of the input pictures, referred to as l-pictures, P-pictures, or B- 
pictures. The l-pictures are intra-coded pictures used mainly for random access or 
scene update. The P-pictures use forward motion predictive coding with reference 
to previously coded l-or P-pictures (anchor pictures), and the B-pictures use both 

15 forward and backward motion predictive/interpolative coding with reference to 
previously coded -or P-pictures. Furthermore, a group of pictures(GOP) is formed 
in encoded order starting with an l-picture and ending with the picture before the 
next l-picture in the sequence. 

The pictures are partitioned into smaller and non-overlapping blocks 

20 of pixel data called macroblocks (MBs) before encoding. Each from a P-or B- 
picture is subjected to a motion estimation process in which forward motion 
vectors, and backward motion vectors in the case of a B-picture MB, are 
determined using reference pictures from a frame buffer. The target macroblock in 
the current picture is matched with a set of displaced macroblocks in the reference 

25 picture, the macroblock that best matches the target macroblock is used as the 
predicted macroblock. The position of this predicted macroblock is specified by a 
set of vectors known as motion vectors, which describe the vertical and horizontal 
displacement between the target and predicted macroblock. For a B-picture, the 
process is similar except there are two reference pictures: anchor pictures 

30 immediately preceding and following the B-picture. 



With the determined motion vectors, motion compensation is 
performed where the intra-or inter-picture prediction mode of the MB is first 
determined according to the accuracy of the motion vectors found, followed by 
generating the necessary predicted MB. l-pictures are always intra coded while for 
5 P and B-pictures, a decision on whether intra or inter coding will be used (at 
macroblock level) is made depending on which method will give rise to a more 
efficient coding. 

Transformation of the macroblock using a DCT is then carried out on 
the 8x8 pixel blocks within the macroblock. For intra coding, the actual picture 

10 data is coded, while for inter coding the prediction error is coded. This is followed 
by a quantization process of the DCT coefficients which involves a quantization 
matrix and a quantization step size. The quantized coefficients are then run-length 
encoded with variable length codes. 

The resultant bit usage and statistical data are passed onto a rate 

15 control module, which is used for allocating a target number of bits used to encode 
each picture and each macroblock within the picture. This is an important module 
in the encoder as it plays a major role in maintaining the quality of the encoded 
picture. 

The rate control in MPEG-2 Test Model 5 (TM5) comprises the steps 
20 of allocating the target amount of bits for coding each picture, deriving the 

reference quantization parameter (Q y ) to be used on each macroblock in a picture, 
and modulating the Qy based on the activity masking level of the surrounding 
blocks of the corresponding macroblock to obtain the modulated quantization step 
size (Mquant) used to quantize the macroblock. 
25 An objective of this rate controller is to ensure all pictures maintain a 

similar level of quality. It assumes that the subjective quality of a single coded 
picture can be qualified with a single number V, described by factor K/Q where K is 
a constant particular to a picture type and Q is the quantization step size for the 
picture being coded. That is, the rate controller will try to preserve the following 
30 equation: 



y = K i ^ K P = K b 

Qi Q P Q b 

The subscripts /, p, b refers to I, P, and B-picture types. The 
parameters K h K p , and Kb are experimentally determined constants for all I, P, and 
B pictures respectively, and K) usually is normalized to the value of 1. 
5 Figure 1 illustrates a block diagram of a known embodiment of the 

TM5 rate controller. A group-of-pictures (GOP) is a collection of one I picture, 
some P pictures and B pictures, and serves as a basic access unit with the I 
picture as the entry point to facilitate random access. The controller comprises 
three levels of processing: a GOP level 1 1 1 , a picture level 1 1 0, and a macroblock 
10 (MB) level 109. At the start of every new GOP at the GOP level 1 1 1 , a GOP bit 
allocation process 100 computes the total number of bits (R g0 p) allocated for the 
GOP. The value of R gop is given by: 

bit rate x N /A 
R o = — = (1 

gop picture _rate 

where 

1 5 bit_rate is the target bit rate for encoding the picture 

sequence, 

picture-rate is the number of pictures coded per second, and 
N is the total number of pictures coded in the GOP. 

The remaining bits {R) for the GOP is determined at the picture level 
20 1 10 by a remaining bits determination process 106. The value of R is updated as: 

R = R-S where S is the number of bits used by previously 

coded pictures in the GOP, and 
R = R +R gop for a new GOP. 

With the computed value of f?, a picture bit allocation process 101 at 
25 the picture level 1 10 then computes a target bit value (7) allocated to the current 
picture according to the equations below for corresponding I, P or B picture type: 
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where 



T ; = max 



R 



bit rate 



N p X p N b X b ' K x picture _rate 
X i K b X i K b 



(2) 



T p = max 



R 



bit rate 



N b K p X b ' K x picture _ rate 



T b = max 



R 



(3) 



bit rate 



N p K b X p * K x picture _rate 

Jy . + 



b p 



(4) 



Tj, T p , Tb 
N b , N p 

Xh Xb 



Represents target bits for the next picture (I, P, B) 
Represents the remaining number of pictures (P, B) in 
the GOP 

Represents global complexity measures and gives a 
measure of the actual number of bits required to 
10 represent the picture (without compression), and 

Xi = Qi x Si 
Xp — Qp x Sp 
X b = Q b x S b 

Si, S p , Sb are the actual number of bits used to code 
1 5 the previous l/P/B frame. 

K s a constant (e.g., 8) 

After computing the target bits for the picture, a reference 
quantization step-size(Qy) computation 102 is carried out at the MB level 109 for 
each macroblock within the picture. The value of Q, is computed based on the 
20 determined target bit allocation (7), 7" p , or Tb) and a buffer fullness value (dj). Each 
of the three picture types has a virtual buffer associated with it and these buffers 
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are updated after coding each macro block of the picture type according to the 
following equations: 

r, *0'-i) 



d J =d'+B J _ l - 



d>=d>+B H -~ p 



d)=dl+B hX - 



MB_Cnt 

T p x{j-\) 
MB_Cnt 

T b x{j-l) 
MB Cnt 



(5) 
(6) 
(7) 



where 



do 
di 



Bj-i 



is the initial virtual buffer fullness, 
is the virtual buffer fullness when coding f h MB, 
is the actual bits consumed up to and including (J - 1) th 
10 MB as provided by a virtual buffer update process 

105, and 

MB_Cnt is the number of MBs in the picture. 

Equations(S) to (7) effectively track the differences between the 
actual number of bits used and the target bits allocated, and these differences are 
15 then added to the respective virtual buffers which are used to compute Qj. This 
allows the rate controller to control the bits allocation based on the consumption 
pattern of the picture Qj is computed from the buffer fullness via the equation: 

Qj - (dj x 31 ) / r = dj x constant (8) 

where r is a reaction parameter = 2 x bitjrate I picture_rate 

20 The reaction parameter, as the name implies, is a factor that can 

control the sensitivity of the algorithm from a change dy. Large r will cause the 
reaction to be slower, which may cause the target bits and actual bits to differ 
significantly, but it brings about a more gradual rate of change in bit consumption 
which is favorable for scene changes. On the other hand, a small r causes the 

25 controller to be more reactive to changes, giving rise to a closer target and actual 
bits value, but a faster response also means a less gradual change during scene 

12 



change which is not desirable. A graph illustrating the difference between the 
target bit allocation and the actual bits allocated within a frame is shown in Figure 
2. 

The value of Q y is further modified by an activity masking process 103 
5 to give the modulated quantization step size, Mquant Basically, the spatial 
masking ability of the macroblock is used to place a correction factor onto the 
computed Qy value. Spatial masking ability is the ability to contain noise masked 
from human visual systems. Typically, an area with complex texture will have a 
larger correction factor than an area with simple texture, hence, 
1 0 Mquant =A/_acf y x Q y (9) 

where N_actj, is the normalize value of macroblock activity level and may have a 
value of between 0.5 and 2 

With the computed Mquant, the macroblock coding process 104 is 
performed where the macro block is compressed according to the MPEG 
15 standards. 

At the end of coding a picture, the bit count S used for coding the 
picture is used to perform a VBV check 107 and to compute the remaining bits R 
for the current GOP at the remaining bits determination process 106. The VBV 
(video buffer verifier) is a virtual buffer emulating the status of the decoding buffer. 

20 The VBV check(107) is performed to detect any overflow or underflow of the 
decoding buffer with reference to the target bit rate. 

When coding a picture sequence with varying scene complexity, a 
constant bit rate (CBR) encoder is unable to vary the bit-rate according to the 
scene complexity. This results in scenarios whereby simple scenes are allocated 

25 more bits than required while complex scenes are allocated insufficient bits, and 
therefore results in a variation of output visual quality between scenes of different 
complexities. 

An embodiment of a moving pictures encoding apparatus according 
to the present invention is illustrated in block diagram form in Figure 3. The 
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encoding apparatus as illustrated is arranged in four levels corresponding to 
processing stages in the bit rate control process, namely a segment level 313, a 
GOP level 312, a picture level 31 1 and a macroblock (MB) level 310. An input 
moving pictures sequence is encoded in such a way that on the average, or at the 
5 end of encoding, the output bit rate is close to a user definable target overall bit 
rate (overallJBR) 320. The input sequence is divided into segments and groups of 
pictures (GOP). The purpose of a segment is for monitoring of output bit rate with 
reference to the overall_BR, and the purpose of the GOP is primarily to facilitate 
random access. A segment may be defined to include a number of GOP(s) 

1 0 depending on the need in terms of frequency of monitoring output bit rate. Each 
GOP comprises at least an l-picture and optionally one or more P-pictures and/or 
B-pictures. Each picture is divided into macroblocks of pixels for encoding. 
Hence, a method of rate control according to the current invention may contain 
processes at the segment level 313, GOP level 312, picture level 31 1 , and finally 

15 macroblock level 310. 

With a new group of pictures, a GOP bit allocation processor 300 
computes the number of bits allocated to the current GOP(F? r gop ) as follows: 

R'=BRx ^ (10) 

picture _rate 

where: 

20 Ngop is the number of pictures in the GOP, 

picture jrate is the number of pictures coded per second, and 
BR is a target bit rate. 

The target bit rate BR is determined for each picture or plurality of 
pictures by a bit rate adjustment processor 306 which is described hereinbelow. A 
25 remaining number of bits (R') for the GOP is determined at the picture level 31 1 by 
a remaining bits determination processor 308, which carries out the steps of: 

(a) before encoding the first picture in a GOP, adjusting R' with a 

new R'gop\ 

14 



/?'+= R'gop, and then set SR 0 w, = BR 

(b) otherwise if the picture to be coded is not the first picture of a 
GOP, and a new target bit rate BR is determined, then adjusting R' with the new 
BR according to: 

5 R'+ = — — x (BR - BR old ), and then set BR old = BR 

picture _rate 

where N is remaining number of pictures in the current GOP 

(c) and removing number of bits used by the last coded picture S 
from the remaining bits value R\ hence: 

R' = R'-S 

10 The computed R' is then passed to a picture bit allocation processor 

301 to compute the target bit (7) allocated to a current picture to be coded. For 
example, equations (2), (3) and (4) described hereinabove may be used for that 
purpose, with the parameters K pt K b , A/ p , N b and pictjype being supplied. Note 
that the lower limit of bitjrate {8*picture_rate) is only optional and may be adjusted 

15 if necessary. 

At the macroblock (MB) level 310, a reference quantization step-size 
computation processor 302 computes a reference quantization step-size Q, for 
each MB using the computed target bit allocation T and a bit utilization parameter 
B determined by a virtual buffer update processor 305. An example of a method 
20 for computing Q, for the f MB of the current picture is represented by: 

K } x D. x picture rate tjt x 

Q =— 1 — = (11) 

If v DD V ' 



K 2 xBR 



where 



Ki and K2 are constants (e.g., 31 and 2 respectively), 

BR is the determined target bit rate, 

25 picturejrate is the number of coded pictures per second, and 

Dj is a virtual buffer fullness 
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The virtual buffer fullness of the /* MB is determined by: 

K* xD } x picture rate 

Q j = — rJ— = 12 

* j K 2 xBR 

where 

D 0 is an initial buffer fullness before coding the current 

5 picture, i.e., Dj at end of the previous picture, 

B is the bit utilization information supplied by the virtual 

buffer update processor 305, i.e., bits used to code 
the 1 st MB to {j'ks) th MB, 
MBJCnt is the total number of MB in the current picture, and 
10 K z is a constant (e.g. , 1 ). 

Three independent current and initial virtual buffers fullness values 
(Dj , Df , Dj , D' Q , D£ , Dq ) may be maintained for the three picture types (I- 
pictures, P-pictures, and B-pictures). The computed Qj may also be further scaled 
by an activity masking processor 303 according to a masking factor determined by 

1 5 the surrounding activity levels of the MB to form the final quantization step-size, 
Mquant, for coding the current MB. A method of masking factor determination can 
be found in the aforementioned MPEG-2TM5. 

Macroblock coding (304) is then performed to encode the current 
MB. A macroblock coding processor 304 may employ, for example, methods 

20 according to the MPEG-1 or MPEG-2 video encoding standards. Such encoding 
methods include necessary motion compensation, discrete cosine transform, 
quantization with the determined quantization step-size, and run-length encoding 
with variable length codes. The methods also include necessary decoding 
processes such that motion compensation can be performed. The number of bits 

25 utilized by the macroblock coding processor 304 to code each MB is passed to the 
virtual buffer update processor 305. 

In applications where a maximum and/or minimum bit rate must be 
maintained for encoding, a VBV checking processor 307 is utilized. User definable 
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maximum and/or minimum bit rates (max/min BR) are input at 321 to the VBV 
checker 307, a method of VBV checking according to the MPEG standards is used 
to examine bit utilization information from the virtual buffer 305, and necessary 
corrections are made to ensure compliance and that the output bit rate is within the 
5 defined max/min BR. 

At the end of coding a picture or a plurality of pictures, bit rate 
adjustment is applied by the bit rate adjustment processor 306 to provide any 
necessary correction to the current target bit rate BR. The correction is based on a 
target encoding quality provided by a target segment quality adjustment processor 

10 309 and a resultant encoding quality of previously coded picture(s) such that the 
overall encoding qualities of pictures within a segment are relatively close to the 
target constant. The resultant encoding quality may be determined by the average 
value of the reference quantization step size (i.e., average Qy) of the previously 
coded picture(s). A method of target bit rate (BR) determination can be derived 

15 from a rate-quantization model as given by: 

np , nn . K 4 x (average _Qj -target _Q ) 

BR = current _BR + (13) 

average _Q j 

where 

current_BR is a current estimated bit rate, or 
current BR = S s + N p x S p + N b x S b , 
20 S/ ( S p , S b are bits used by previously coded I, P, and B-pictures 

respectively, 

A/ p , N b are total number of P and B-pictures in the current 

GOP, 

average_Qj is the average value of Qy of previously coded 
25 picture(s) 

target_Qj is the target encoding quality or target value of Qy, 
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Ka may be a constant, or a factor of previous BR, 

max_BR, or current_BR, and K, may also be 
separately determined for I, P, and B-picture types. 

Maximum and/or minimum bit rates can be applied to the determined 
5 BR according to application requirements. The max/min BR input at 322 are used 
as given by: 

if (BR > max_BR), then BR = maxJBR 
if (BR > min_BR), then BR = minJBR 

It is also possible to make use of the target_Qj at the macro-block 
10 level to increase efficiency of encoding. Basically, the target_Qj is used by the 
reference quantization step-size computation processor 302 as a lower limit of the 
output reference quantization step-size Q, such that when this target (quality) is 
reached at the MB level, bits are saved for future encoding immediately. In this 
case, the output reference quantization step size may be set according to: 

15 if (Qj< target _Qj), then Qj= target _Qj. 

The target bit rate BR is adjusted continuously with reference to a 
fixed target encoding quality within each segment. In turn at the beginning of each 
segment, the target encoding quality is adjusted so that a target overall bit rate is 
achieved. A segment may contain a few groups of pictures depending on the need 
20 in terms of frequency of monitoring output bit rate. A new segment may also be 
defined by a scene change detectable by any conventional scene change detector 
with a given range of time. 

At the onset of a new segment, the target segment quality 
adjustment processor 309 executes a check on the overall bit-rate based on bits 
25 usage and makes any necessary adjustments to the target encoding quality for the 
bit rate adjustment processor 306 so that the overall bit-rate converges to a user 
definable target overall bit-rate 320 (overall^BR). 

An example embodiment of a constant overall bit rate controller 
implementing the target segment quality adjustment process according to the 
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present invention is illustrated in block diagram form in Figure 4. The bit count 
difference between the actual and target bit counts (bits_diff) for coding of a 
previous segment is calculated by a bits difference computation processor 401 , 
utilizing the steps of: 
5 a) obtaining the actual bits (bits_segment) used for coding the 

previous segment according to the input bit utilization of pictures (S) 400 in the 
previous segment, and 

b) computing the value of bits_diff based on a corresponding target 
segment bit rate 406 (segment_BR) for the previous segment, according to: 

segment BR x N„ om , nt 

1 0 bits_diff = — = =!- - bits _segment (14) 

picture _rate 

where N segmen t is the number of coded pictures in the previous segment. 

The bit count difference(b/fs_d/7i0 is then redistributed by a bit 
difference distribution processor 402 over the next k number of segments using a 
bit distribution function, f(m), expressed as: 

15 delta _bits m = f(m) x bits _diff 

where 

m = l,...,k, 

deltajDitSm is the bits difference distributed to next m th segment. 

20 For an example case of k = 4 and f(m) = 1//c, Figure 5 illustrates how 

the bit count difference {bits_diff) of an encoded segment is distributed over the 
next 4 segments. The period of k segments used for bits compensation is referred 
to as an adjustment interval. A delta bit-rate computation processor 403 (Figure 4) 
accumulates all bit differences {delta_bits) distributed from previously encoded 

25 segments to the current segment to be coded, and computes a delta segment bit- 
rate (&segment_BR) based on the accumulated deltajbits and the number of 
pictures in the current segment to be coded. The delta segment bit-rate and a 
user definable target overall bit-rate (overall_BR) input at 408 are used by the 
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target segment BR adjustment processor 405 to derive the target segment bit rate 
(segment_BR) 406 for the current segment. The value of segment_BR can be 
expressed as: 

segmenet _ BR = overall _ BR -s- Asegment _ BR (16) 

5 A new target encoding quality (target_Qj) 407 for encoding of the 

current segment is then determined from the target segment bit rate using a BR-Qj 
modeling processor 404. The BR-Qj modeling processor may operate, for 
example, according to: 

_ {segmenet BR- segment BR') 

AQj =target_Q'j x K * ~ f— = 1 (17) 

J K $ x segment _ BR 

10 target _Qj = t arget _Q) + AQ } 

where 

target_Q) is the target Qj of the previous segment, 
segment_BR' is the target segment bit rate of the previous 
segments, and 

15 K$ is a constant which can be experimentally 

determined. 

A maximum limit may be placed on the determined AQ, so that there 
is no drastic change in output quality from one segment to another. 

A flow chart for a process of variable bit rate (VBR) encoding with 

20 constant overall bit rate control according to an embodiment of the invention is 
illustrated in Figure 6. Initialization of predetermined parameters is first carried out 
at step 600. Then, at step 601 the process determines whether processing is at 
the start of a new group-of-pictures (GOP) or not. Where a new GOP is 
determined, step 602 is carried out by updating the bit allocation for the new GOP. 

25 This may involve, for example, computation of the GOP bit allocation R' gop 
according to Equation (10) described hereinabove and accumulating to the 
remaining bit value R' to give the R' for the new GOP. 
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For each picture in the current GOP, a target bit allocation value T is 
determined at step 603, and this may be computed according to Equations (2), (3) 
and (4), for example. Then, for each macroblock in the picture, a reference 
quantization step size Q is determined using the computed target bit allocation T, 
5 such as by the process represented by Equations (11) and (12). Activity masking 
may also be included in this step. The macroblock of the picture is then encoded 
at step 605 using the computed quantization step size, and steps 604 and 605 are 
repeated for all of the macroblocks in the picture until the end of the picture is 
determined at step 606. 
10 When the end of the sequence of moving pictures which is being 

encoded is reached, this is determined at step 607, which terminates the process 
at step 61 1 upon that occurrence. Otherwise, the process continues to step 608 
where it is determined whether the end of the current segment of pictures has 
been reached. 

1 5 When the end of a segment of pictures is reached and processing of 

a new segment about to begin, the target encoding quality is adjusted at step 609. 
This is performed by firstly computing the difference between the number of bits 
allocated for coding the previous segment and the actual number of bits used in 
coding that segment, such as by the process represented by Equation (14). This 

20 quantity, bitsjdiff, represents extra bits which are left over from the previous 
segment of pictures, and may be distributed for use in encoding the ensuing 
picture segments. The left over bits bits_diff are distributed for use over one or 
more segments according to a bit distribution function, an example of which is 
described in connection with Equation (15). In essence, the left over bits from the 

25 previous segment are divided into a plurality of k, preferably equal, amounts and 
allocated to the following k segments for encoding with. The dividends allocated 
from all of the k previously processed segments are accumulated and used to 
determine a change in segment bit rate, &segment_BR, according to the 
accumulated bits difference and the number of pictures in the segment to be 

30 coded. This can then be used to compute an allocated bit rate for the segment to 



be coded according to Equation (16).Finally, a target encoding quality, target_Q jt 
can be computed according to Equations (17) and (18), for example. 

If the end of a segment has not been reached at step 608, or after 
the target quality adjustment of step 609, the target quality and the actual quality of 
5 encoded pictures is compared, and a new bit rate for encoding the next picture is 
computed based on the difference. This step can be carried out as described in 
relation to Equation (13), for example. The procedure then returns to step 601 , 
described above. 

In summary, embodiments of the present invention provide methods 

10 and apparatus for encoding moving pictures with a variable bit rate whilst 

maintaining a consistent output encoding quality according to a determined target 
quality. The target quality is adjusted at the beginning of every moving pictures 
segment according to a bit distribution function to ensure an overall bit rate that 
meets a defined target. Furthermore, the segment based bit distribution function 

15 provides smooth and flexible modulation of the encoding quality. The method can 
be implemented at low cost and with the ability to encode an input moving pictures 
sequence in one-pass at real-time. 

It will be readily apparent to those of ordinary skill in the art that the 
foregoing detailed description of the present invention has been presented byway 

20 of example only, and is not intended to be considered limiting to the invention as 
defined in the claims appended hereto. In particular, it is envisaged that numerous 
variations to the embodiments as described can be made without departing from 
the spirit and scope of the invention. 

Throughout this specification and the claims which follow, unless the 

25 context requires otherwise, the word "comprise", and variations such as 

"comprises" and "comprising", will be understood to imply the inclusion of a stated 
integer or step or group of integers or steps but not the exclusion of any other 
integer or step or group of integers or steps. 
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GLOSSARY 
video buffer verifier 
variable bit rate 
constant bit rate 
Moving Picture Experts Group 
discrete cosine transform 
group of pictures 
macroblock 

quantisation parameter (picture quality) 
quantisation parameter for macroblock j 
quantisation parameters for I, B and P type pictures 
number of bits allocated for encoding a group of pictures 
target bit rate for a picture sequence 
number of pictures coded per second 
total number of pictures in the group of pictures 
remaining number of P, B pictures in the group of pictures 
remaining number of bits for encoding a group of pictures 
number of bits already used in coding a group of pictures 
target number of bits for coding next I, P, B type picture 
global complexity measures for I, P, B type pictured 
virtual buffer fullness when coding the j 01 macroblock 
initial virtual buffer fullness 

actual bits used up to and including(/-1) th macroblock 

number of macroblocks in the picture 

reaction parameter (= 2 x bit_rate / picture jrate) 

normalised macroblock activity level 

modulated quantisation step size (= N_actjx Qj) 

definable target overall bit rate 

target bit rate 
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